Overview

Dataset statistics

Number of variables21
Number of observations19695
Missing cells6738
Missing cells (%)1.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 MiB
Average record size in memory168.0 B

Variable types

NUM21

Warnings

sysbp has 206 (1.0%) missing values Missing
wbc has 356 (1.8%) missing values Missing
mcv has 351 (1.8%) missing values Missing
plt has 358 (1.8%) missing values Missing
bun has 207 (1.1%) missing values Missing
glu has 212 (1.1%) missing values Missing
crea has 232 (1.2%) missing values Missing
cho has 214 (1.1%) missing values Missing
tg has 213 (1.1%) missing values Missing
hdl has 206 (1.0%) missing values Missing
ldl has 225 (1.1%) missing values Missing
crp has 207 (1.1%) missing values Missing
hgb has 354 (1.8%) missing values Missing
cysc has 2576 (13.1%) missing values Missing
df_index has unique values Unique

Reproduction

Analysis started2020-09-09 07:37:53.680320
Analysis finished2020-09-09 07:39:29.744960
Duration1 minute and 36.06 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct19695
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9950.990251
Minimum0
Maximum19912
Zeros1
Zeros (%)< 0.1%
Memory size153.9 KiB

Quantile statistics

Minimum0
5-th percentile993.7
Q14978.5
median9942
Q314924.5
95-th percentile18921.3
Maximum19912
Range19912
Interquartile range (IQR)9946

Descriptive statistics

Standard deviation5747.363438
Coefficient of variation (CV)0.5775669851
Kurtosis-1.198499481
Mean9950.990251
Median Absolute Deviation (MAD)4973
Skewness0.001239172482
Sum195984753
Variance33032186.49
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20471< 0.1%
 
129471< 0.1%
 
67901< 0.1%
 
47431< 0.1%
 
190841< 0.1%
 
170371< 0.1%
 
108961< 0.1%
 
88491< 0.1%
 
149941< 0.1%
 
27081< 0.1%
 
Other values (19685)1968599.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
199121< 0.1%
 
199111< 0.1%
 
199101< 0.1%
 
199091< 0.1%
 
199081< 0.1%
 

sysbp
Real number (ℝ≥0)

MISSING

Distinct336
Distinct (%)1.7%
Missing206
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean130.0536627
Minimum90
Maximum193
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum90
5-th percentile101.333336
Q1115
median127.333336
Q3142.33333
95-th percentile170.66667
Maximum193
Range103
Interquartile range (IQR)27.33333

Descriptive statistics

Standard deviation20.56889575
Coefficient of variation (CV)0.1581569893
Kurtosis0.2206368255
Mean130.0536627
Median Absolute Deviation (MAD)13.666664
Skewness0.6718637101
Sum2534615.833
Variance423.0794723
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1251500.8%
 
120.3333361490.8%
 
124.6666641460.7%
 
1201450.7%
 
1241450.7%
 
123.3333361440.7%
 
127.6666641430.7%
 
1211410.7%
 
1291410.7%
 
122.3333361390.7%
 
Other values (326)1804691.6%
 
(Missing)2061.0%
 
ValueCountFrequency (%) 
90550.3%
 
90.3333369< 0.1%
 
90.6666644< 0.1%
 
912< 0.1%
 
91.3333368< 0.1%
 
ValueCountFrequency (%) 
1936< 0.1%
 
1923< 0.1%
 
191.51< 0.1%
 
191510.3%
 
190.666677< 0.1%
 

diabp
Real number (ℝ≥0)

Distinct211
Distinct (%)1.1%
Missing193
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean75.81249787
Minimum50
Maximum112
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum50
5-th percentile58
Q167.666664
median75
Q383.333336
95-th percentile96.5
Maximum112
Range62
Interquartile range (IQR)15.666672

Descriptive statistics

Standard deviation11.61856685
Coefficient of variation (CV)0.1532539776
Kurtosis-0.02358622046
Mean75.81249787
Median Absolute Deviation (MAD)7.666664
Skewness0.3858575459
Sum1478495.333
Variance134.9910956
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
72.6666642511.3%
 
752461.2%
 
75.6666642441.2%
 
712371.2%
 
73.3333362371.2%
 
71.6666642371.2%
 
77.3333362351.2%
 
73.6666642341.2%
 
74.3333362311.2%
 
762291.2%
 
Other values (201)1712186.9%
 
ValueCountFrequency (%) 
50420.2%
 
50.333332100.1%
 
50.666668180.1%
 
51270.1%
 
51.333332490.2%
 
ValueCountFrequency (%) 
1126< 0.1%
 
1111< 0.1%
 
110.53< 0.1%
 
110.333336500.3%
 
1108< 0.1%
 

pulse
Real number (ℝ≥0)

Distinct191
Distinct (%)1.0%
Missing192
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean73.19197902
Minimum50.333332
Maximum105.666664
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum50.333332
5-th percentile57.333332
Q166
median72.333336
Q379.666664
95-th percentile92
Maximum105.666664
Range55.333332
Interquartile range (IQR)13.666664

Descriptive statistics

Standard deviation10.39573775
Coefficient of variation (CV)0.1420338388
Kurtosis0.08417179563
Mean73.19197902
Median Absolute Deviation (MAD)6.666672
Skewness0.4119839858
Sum1427463.167
Variance108.0713633
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
73.3333362741.4%
 
742731.4%
 
712701.4%
 
722611.3%
 
71.3333362611.3%
 
68.6666642601.3%
 
672601.3%
 
74.6666642571.3%
 
702561.3%
 
692561.3%
 
Other values (181)1687585.7%
 
ValueCountFrequency (%) 
50.333332570.3%
 
50.51< 0.1%
 
50.666668180.1%
 
51160.1%
 
51.333332690.4%
 
ValueCountFrequency (%) 
105.666664480.2%
 
105.3333368< 0.1%
 
1055< 0.1%
 
104.666664100.1%
 
104.53< 0.1%
 

wbc
Real number (ℝ≥0)

MISSING

Distinct765
Distinct (%)4.0%
Missing356
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean6.101882672
Minimum2.94
Maximum12.2
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum2.94
5-th percentile3.7
Q14.86
median5.82
Q37.08
95-th percentile9.441
Maximum12.2
Range9.26
Interquartile range (IQR)2.22

Descriptive statistics

Standard deviation1.757569115
Coefficient of variation (CV)0.2880371862
Kurtosis0.8475955069
Mean6.101882672
Median Absolute Deviation (MAD)1.08
Skewness0.8685200037
Sum118004.309
Variance3.089049195
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
5.64032.0%
 
5.33902.0%
 
5.13781.9%
 
5.53631.8%
 
63591.8%
 
5.43581.8%
 
5.83551.8%
 
4.93501.8%
 
53501.8%
 
5.93481.8%
 
Other values (755)1568579.6%
 
(Missing)3561.8%
 
ValueCountFrequency (%) 
2.941030.5%
 
2.991< 0.1%
 
3170.1%
 
3.021< 0.1%
 
3.031< 0.1%
 
ValueCountFrequency (%) 
12.21050.5%
 
12.192< 0.1%
 
12.14< 0.1%
 
12100.1%
 
11.94< 0.1%
 

mcv
Real number (ℝ≥0)

MISSING

Distinct706
Distinct (%)3.6%
Missing351
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean91.11849721
Minimum64.4
Maximum109.8
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum64.4
5-th percentile76
Q187.6
median91.8
Q395.81
95-th percentile102.5
Maximum109.8
Range45.4
Interquartile range (IQR)8.21

Descriptive statistics

Standard deviation7.770628377
Coefficient of variation (CV)0.0852804712
Kurtosis1.659431315
Mean91.11849721
Median Absolute Deviation (MAD)4.1
Skewness-0.816923125
Sum1762596.21
Variance60.38266538
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
912621.3%
 
902281.2%
 
922211.1%
 
952141.1%
 
932141.1%
 
892061.0%
 
942031.0%
 
881951.0%
 
961520.8%
 
871480.8%
 
Other values (696)1730187.8%
 
(Missing)3511.8%
 
ValueCountFrequency (%) 
64.41< 0.1%
 
64.52< 0.1%
 
64.61010.5%
 
64.81< 0.1%
 
64.92< 0.1%
 
ValueCountFrequency (%) 
109.81010.5%
 
109.74< 0.1%
 
109.62< 0.1%
 
109.521< 0.1%
 
109.52< 0.1%
 

plt
Real number (ℝ≥0)

MISSING

Distinct452
Distinct (%)2.3%
Missing358
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean207.3435311
Minimum58
Maximum417
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum58
5-th percentile100
Q1161
median204
Q3250
95-th percentile326
Maximum417
Range359
Interquartile range (IQR)89

Descriptive statistics

Standard deviation68.23984351
Coefficient of variation (CV)0.3291148905
Kurtosis0.193196641
Mean207.3435311
Median Absolute Deviation (MAD)44
Skewness0.3824141484
Sum4009401.86
Variance4656.676242
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
2081370.7%
 
1901370.7%
 
2041340.7%
 
2001330.7%
 
2101310.7%
 
1961290.7%
 
1891240.6%
 
1721240.6%
 
1951240.6%
 
1881240.6%
 
Other values (442)1804091.6%
 
(Missing)3581.8%
 
ValueCountFrequency (%) 
581090.6%
 
599< 0.1%
 
604< 0.1%
 
614< 0.1%
 
627< 0.1%
 
ValueCountFrequency (%) 
4171040.5%
 
4163< 0.1%
 
4151< 0.1%
 
4145< 0.1%
 
4132< 0.1%
 

bun
Real number (ℝ≥0)

MISSING

Distinct823
Distinct (%)4.2%
Missing207
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean15.66301065
Minimum7.282912
Maximum29.46652
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum7.282912
5-th percentile9.523808
Q112.57649
median15.1254
Q318.20728
95-th percentile23.8547165
Maximum29.46652
Range22.183608
Interquartile range (IQR)5.63079

Descriptive statistics

Standard deviation4.364493263
Coefficient of variation (CV)0.2786497028
Kurtosis0.4102920995
Mean15.66301065
Median Absolute Deviation (MAD)2.800472
Skewness0.7125426112
Sum305240.7516
Variance19.04880144
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
14.2857123081.6%
 
13.1652642771.4%
 
14.00562761.4%
 
12.8851522731.4%
 
14.5658242711.4%
 
14.8459362631.3%
 
12.3249282621.3%
 
13.7254882601.3%
 
12.605042561.3%
 
15.6862722551.3%
 
Other values (813)1678785.2%
 
ValueCountFrequency (%) 
7.282912820.4%
 
7.563024310.2%
 
7.674741000.5%
 
7.730763< 0.1%
 
7.758771< 0.1%
 
ValueCountFrequency (%) 
29.46652860.4%
 
29.382491< 0.1%
 
29.354487< 0.1%
 
29.298461< 0.1%
 
29.270451< 0.1%
 

glu
Real number (ℝ≥0)

MISSING

Distinct974
Distinct (%)5.0%
Missing212
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean106.5175814
Minimum63.9
Maximum276.12
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum63.9
5-th percentile79.27928
Q190.9
median99.0991
Q3110.34
95-th percentile160.36037
Maximum276.12
Range212.22
Interquartile range (IQR)19.44

Descriptive statistics

Standard deviation30.90294468
Coefficient of variation (CV)0.2901206006
Kurtosis12.09344631
Mean106.5175814
Median Absolute Deviation (MAD)9.00901
Skewness3.095201261
Sum2075282.038
Variance954.99199
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
91.891896253.2%
 
93.6936955792.9%
 
90.090095762.9%
 
97.2972955502.8%
 
95.49555372.7%
 
88.288295232.7%
 
86.486495052.6%
 
99.09914722.4%
 
84.6846853932.0%
 
100.90093882.0%
 
Other values (964)1433572.8%
 
ValueCountFrequency (%) 
63.9890.5%
 
64.443< 0.1%
 
64.623< 0.1%
 
64.864871050.5%
 
65.162< 0.1%
 
ValueCountFrequency (%) 
276.121000.5%
 
273.422< 0.1%
 
272.71< 0.1%
 
271.981< 0.1%
 
271.263< 0.1%
 

crea
Real number (ℝ≥0)

MISSING

Distinct959
Distinct (%)4.9%
Missing232
Missing (%)1.2%
Infinite0
Infinite (%)0.0%
Mean0.7904956773
Minimum0.4294
Maximum1.82353
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum0.4294
5-th percentile0.5311
Q10.6554
median0.7571
Q30.8902717
95-th percentile1.1413
Maximum1.82353
Range1.39413
Interquartile range (IQR)0.2348717

Descriptive statistics

Standard deviation0.2034374505
Coefficient of variation (CV)0.2573542859
Kurtosis4.676645017
Mean0.7904956773
Median Absolute Deviation (MAD)0.113
Skewness1.553254773
Sum15385.41737
Variance0.04138679628
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.72322951.5%
 
0.70062821.4%
 
0.68932811.4%
 
0.75712771.4%
 
0.71192761.4%
 
0.64412701.4%
 
0.6782651.3%
 
0.76842571.3%
 
0.77972551.3%
 
0.66672541.3%
 
Other values (949)1675185.1%
 
ValueCountFrequency (%) 
0.42941190.6%
 
0.4407190.1%
 
0.452250.1%
 
0.4633260.1%
 
0.4660635890.5%
 
ValueCountFrequency (%) 
1.823531020.5%
 
1.82352982< 0.1%
 
1.8190052< 0.1%
 
1.81787371< 0.1%
 
1.81674251< 0.1%
 

cho
Real number (ℝ≥0)

MISSING

Distinct918
Distinct (%)4.7%
Missing214
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean188.8221074
Minimum111.583
Maximum300.7748
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum111.583
5-th percentile133.97684
Q1163.1452
median185.9546
Q3211.0836
95-th percentile254.3828
Maximum300.7748
Range189.1918
Interquartile range (IQR)47.9384

Descriptive statistics

Standard deviation36.343317
Coefficient of variation (CV)0.1924738448
Kurtosis0.1431174359
Mean188.8221074
Median Absolute Deviation (MAD)23.79243
Skewness0.4677698443
Sum3678443.475
Variance1320.83669
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
300.77481030.5%
 
286.4865990.5%
 
115.98900.5%
 
111.583850.4%
 
190.9804610.3%
 
176.44788600.3%
 
184.4082580.3%
 
199.4856570.3%
 
161.38997570.3%
 
185.7143560.3%
 
Other values (908)1875595.2%
 
(Missing)2141.1%
 
ValueCountFrequency (%) 
111.583850.4%
 
111.5830152< 0.1%
 
111.969128< 0.1%
 
112.355224< 0.1%
 
112.741315< 0.1%
 
ValueCountFrequency (%) 
300.77481030.5%
 
300.38821< 0.1%
 
300.00162< 0.1%
 
299.6151< 0.1%
 
299.22841< 0.1%
 

tg
Real number (ℝ≥0)

MISSING

Distinct947
Distinct (%)4.9%
Missing213
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean136.8644796
Minimum38.055
Maximum540.735
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum38.055
5-th percentile53.1
Q178.765
median110.61947
Q3162.83186
95-th percentile318.599204
Maximum540.735
Range502.68
Interquartile range (IQR)84.06686

Descriptive statistics

Standard deviation88.75168037
Coefficient of variation (CV)0.6484639449
Kurtosis5.197006382
Mean136.8644796
Median Absolute Deviation (MAD)37.17553
Skewness2.093913936
Sum2666393.792
Variance7876.860768
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
5001680.9%
 
38.0551100.6%
 
74.34990.5%
 
540.735950.5%
 
45.13274910.5%
 
99.11504900.5%
 
77.88890.5%
 
79.65880.4%
 
72.57860.4%
 
86.73860.4%
 
Other values (937)1848093.8%
 
(Missing)2131.1%
 
ValueCountFrequency (%) 
38.0551100.6%
 
38.94180.1%
 
39.825170.1%
 
40.71210.1%
 
41.595220.1%
 
ValueCountFrequency (%) 
540.735950.5%
 
539.851< 0.1%
 
538.081< 0.1%
 
533.6551< 0.1%
 
530.1151< 0.1%
 

hdl
Real number (ℝ≥0)

MISSING

Distinct379
Distinct (%)1.9%
Missing206
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean51.11923961
Minimum22.4228
Maximum97.0366
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum22.4228
5-th percentile32.0878
Q141.7528
median49.4848
Q358.68726
95-th percentile75.28957
Maximum97.0366
Range74.6138
Interquartile range (IQR)16.93446

Descriptive statistics

Standard deviation13.13967422
Coefficient of variation (CV)0.2570397041
Kurtosis0.5903548264
Mean51.11923961
Median Absolute Deviation (MAD)8.17206
Skewness0.6560850656
Sum996262.8608
Variance172.6510387
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
48.6486471630.8%
 
49.806951500.8%
 
46.7181471490.8%
 
50.9652521480.8%
 
44.7876431470.7%
 
49.420851460.7%
 
43.2432441440.7%
 
52.509651410.7%
 
42.084941390.7%
 
53.2818531380.7%
 
Other values (369)1802491.5%
 
(Missing)2061.0%
 
ValueCountFrequency (%) 
22.42281060.5%
 
22.80947< 0.1%
 
23.1968< 0.1%
 
23.58269< 0.1%
 
23.96929< 0.1%
 
ValueCountFrequency (%) 
97.0366980.5%
 
96.657< 0.1%
 
96.26342< 0.1%
 
95.87683< 0.1%
 
95.49024< 0.1%
 

ldl
Real number (ℝ≥0)

MISSING

Distinct838
Distinct (%)4.3%
Missing225
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean109.3878884
Minimum37.8868
Maximum210.3104
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum37.8868
5-th percentile61.776062
Q187.25869
median107.0882
Q3128.3512
95-th percentile167.1048485
Maximum210.3104
Range172.4236
Interquartile range (IQR)41.09251

Descriptive statistics

Standard deviation31.82257418
Coefficient of variation (CV)0.290914969
Kurtosis0.2469892626
Mean109.3878884
Median Absolute Deviation (MAD)20.4898
Skewness0.4601529808
Sum2129782.188
Variance1012.676227
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
210.31041070.5%
 
37.88681050.5%
 
180.30891020.5%
 
43.62934860.4%
 
92.27799690.4%
 
113.12741680.3%
 
108.88031660.3%
 
87.64479660.3%
 
98.4556650.3%
 
111.583015650.3%
 
Other values (828)1867194.8%
 
(Missing)2251.1%
 
ValueCountFrequency (%) 
37.88681050.5%
 
38.27341< 0.1%
 
38.662< 0.1%
 
39.04661< 0.1%
 
39.43321< 0.1%
 
ValueCountFrequency (%) 
210.31041070.5%
 
209.92383< 0.1%
 
209.15064< 0.1%
 
208.7641< 0.1%
 
208.37742< 0.1%
 

crp
Real number (ℝ≥0)

MISSING

Distinct1207
Distinct (%)6.2%
Missing207
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean2.545735837
Minimum0.1
Maximum37.02
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum0.1
5-th percentile0.3
Q10.63
median1.21
Q32.49
95-th percentile8.9
Maximum37.02
Range36.92
Interquartile range (IQR)1.86

Descriptive statistics

Standard deviation4.504125058
Coefficient of variation (CV)1.769282182
Kurtosis27.87057685
Mean2.545735837
Median Absolute Deviation (MAD)0.71
Skewness4.8409884
Sum49611.3
Variance20.28714254
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.55232.7%
 
0.65182.6%
 
0.74972.5%
 
0.84812.4%
 
0.94732.4%
 
0.44352.2%
 
13992.0%
 
1.13932.0%
 
1.33781.9%
 
1.23501.8%
 
Other values (1197)1504176.4%
 
ValueCountFrequency (%) 
0.12151.1%
 
0.191110.6%
 
0.21810.9%
 
0.21450.2%
 
0.22370.2%
 
ValueCountFrequency (%) 
37.021000.5%
 
36.71< 0.1%
 
36.351< 0.1%
 
36.22< 0.1%
 
36.11< 0.1%
 

hbalc
Real number (ℝ≥0)

Distinct66
Distinct (%)0.3%
Missing113
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean5.617337351
Minimum4.1
Maximum10.6
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum4.1
5-th percentile4.6
Q15.1
median5.5
Q35.9
95-th percentile7.1
Maximum10.6
Range6.5
Interquartile range (IQR)0.8

Descriptive statistics

Standard deviation0.8904415554
Coefficient of variation (CV)0.1585166601
Kurtosis9.001645311
Mean5.617337351
Median Absolute Deviation (MAD)0.4
Skewness2.399274476
Sum109998.7
Variance0.7928861636
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
5.412746.5%
 
5.612526.4%
 
5.312526.4%
 
5.512256.2%
 
5.212106.1%
 
5.711445.8%
 
5.111245.7%
 
5.811125.6%
 
510815.5%
 
5.99644.9%
 
Other values (56)794440.3%
 
ValueCountFrequency (%) 
4.11160.6%
 
4.2570.3%
 
4.3910.5%
 
4.41720.9%
 
4.52481.3%
 
ValueCountFrequency (%) 
10.61110.6%
 
10.58< 0.1%
 
10.44< 0.1%
 
10.36< 0.1%
 
10.28< 0.1%
 

ua
Real number (ℝ≥0)

Distinct2565
Distinct (%)13.1%
Missing189
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean4.691114677
Minimum2.142
Maximum9
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum2.142
5-th percentile2.8
Q13.71448
median4.50576
Q35.48352
95-th percentile7.2
Maximum9
Range6.858
Interquartile range (IQR)1.76904

Descriptive statistics

Standard deviation1.324802898
Coefficient of variation (CV)0.2824068455
Kurtosis0.3508401796
Mean4.691114677
Median Absolute Deviation (MAD)0.85848
Skewness0.6715059351
Sum91504.88288
Variance1.755102717
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
4.43401.7%
 
4.32991.5%
 
5.12991.5%
 
4.62921.5%
 
4.72881.5%
 
4.52811.4%
 
3.92801.4%
 
4.82751.4%
 
52751.4%
 
42711.4%
 
Other values (2555)1660684.3%
 
ValueCountFrequency (%) 
2.142980.5%
 
2.143682< 0.1%
 
2.145361< 0.1%
 
2.15041< 0.1%
 
2.153761< 0.1%
 
ValueCountFrequency (%) 
91120.6%
 
8.9130.1%
 
8.8170.1%
 
8.78< 0.1%
 
8.6160.1%
 

htc
Real number (ℝ≥0)

Distinct665
Distinct (%)3.4%
Missing134
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean41.41258167
Minimum26.77
Maximum57.6
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum26.77
5-th percentile31.5
Q138
median41.4
Q344.9
95-th percentile50.8
Maximum57.6
Range30.83
Interquartile range (IQR)6.9

Descriptive statistics

Standard deviation5.637021357
Coefficient of variation (CV)0.136118569
Kurtosis0.3229606389
Mean41.41258167
Median Absolute Deviation (MAD)3.5
Skewness0.09707370626
Sum810071.51
Variance31.77600978
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
403051.5%
 
392711.4%
 
432591.3%
 
382491.3%
 
422491.3%
 
30.42411.2%
 
412381.2%
 
442231.1%
 
451750.9%
 
371750.9%
 
Other values (655)1717687.2%
 
ValueCountFrequency (%) 
26.771070.5%
 
26.84< 0.1%
 
26.881< 0.1%
 
26.92< 0.1%
 
26.931< 0.1%
 
ValueCountFrequency (%) 
57.61120.6%
 
57.54< 0.1%
 
57.451< 0.1%
 
57.43< 0.1%
 
57.32< 0.1%
 

hgb
Real number (ℝ≥0)

MISSING

Distinct198
Distinct (%)1.0%
Missing354
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean14.00615356
Minimum8.9
Maximum21.8
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum8.9
5-th percentile11
Q112.7
median13.9
Q315.1
95-th percentile17.3
Maximum21.8
Range12.9
Interquartile range (IQR)2.4

Descriptive statistics

Standard deviation1.969609969
Coefficient of variation (CV)0.1406246162
Kurtosis1.326235474
Mean14.00615356
Median Absolute Deviation (MAD)1.2
Skewness0.5131756382
Sum270893.016
Variance3.879363429
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
13.74642.4%
 
13.14522.3%
 
13.94472.3%
 
13.64432.2%
 
14.14372.2%
 
13.44302.2%
 
13.84302.2%
 
14.24252.2%
 
144202.1%
 
13.54172.1%
 
Other values (188)1497676.0%
 
ValueCountFrequency (%) 
8.9910.5%
 
9120.1%
 
9.15< 0.1%
 
9.2110.1%
 
9.31110.6%
 
ValueCountFrequency (%) 
21.8960.5%
 
21.76< 0.1%
 
21.63< 0.1%
 
21.56< 0.1%
 
21.42< 0.1%
 

cysc
Real number (ℝ≥0)

MISSING

Distinct140
Distinct (%)0.8%
Missing2576
Missing (%)13.1%
Infinite0
Infinite (%)0.0%
Mean0.9292896781
Minimum0.5
Maximum1.91
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum0.5
5-th percentile0.62
Q10.78
median0.9
Q31.03
95-th percentile1.35
Maximum1.91
Range1.41
Interquartile range (IQR)0.25

Descriptive statistics

Standard deviation0.2263221804
Coefficient of variation (CV)0.2435431983
Kurtosis2.103175456
Mean0.9292896781
Median Absolute Deviation (MAD)0.13
Skewness1.116043575
Sum15908.51
Variance0.05122172932
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.943862.0%
 
0.953841.9%
 
0.833831.9%
 
0.853801.9%
 
0.93541.8%
 
0.883531.8%
 
0.963521.8%
 
0.873511.8%
 
0.863451.8%
 
0.933451.8%
 
Other values (130)1348668.5%
 
(Missing)257613.1%
 
ValueCountFrequency (%) 
0.5630.3%
 
0.51150.1%
 
0.52280.1%
 
0.53230.1%
 
0.54290.1%
 
ValueCountFrequency (%) 
1.91690.4%
 
1.892< 0.1%
 
1.883< 0.1%
 
1.877< 0.1%
 
1.864< 0.1%
 

age
Real number (ℝ≥0)

Distinct537
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60.74826524
Minimum40
Maximum85
Zeros0
Zeros (%)0.0%
Memory size153.9 KiB

Quantile statistics

Minimum40
5-th percentile47
Q153.416668
median60.166668
Q366.916664
95-th percentile77.166664
Maximum85
Range45
Interquartile range (IQR)13.499996

Descriptive statistics

Standard deviation9.160124547
Coefficient of variation (CV)0.150788249
Kurtosis-0.5620874482
Mean60.74826524
Median Absolute Deviation (MAD)6.75
Skewness0.3124301431
Sum1196437.084
Variance83.90788172
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
64.583336960.5%
 
58.583332930.5%
 
59.416668910.5%
 
60.833332870.4%
 
62.083332860.4%
 
62.583332850.4%
 
60.666668830.4%
 
60.583332820.4%
 
61.583332810.4%
 
57.916668810.4%
 
Other values (527)1883095.6%
 
ValueCountFrequency (%) 
402< 0.1%
 
40.1666682< 0.1%
 
40.252< 0.1%
 
40.4166682< 0.1%
 
40.51< 0.1%
 
ValueCountFrequency (%) 
853< 0.1%
 
84.9166643< 0.1%
 
84.8333365< 0.1%
 
84.754< 0.1%
 
84.6666642< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

df_indexsysbpdiabppulsewbcmcvpltbunglucreachotghdlldlcrphbalcuahtchgbcyscage
00123.66666467.33333655.3333329.672.9198.019.4389495.940.9831254.382863.72063.4024175.90300.984.63.5952057.620.20.9248.166668
11143.33333074.66666467.6666645.395.4179.012.3524194.140.8814205.671257.52561.4694131.05740.344.95.2466452.017.4NaN59.916668
22178.66667087.00000055.0000007.588.3271.022.09989105.841.1526168.171088.50047.165298.96965.674.84.5595244.514.91.3460.833332
33191.000000106.33333658.3333324.786.1208.015.6295887.480.6554219.975491.15546.0054154.25342.474.63.4473646.016.10.9167.666664
44118.33333654.00000053.6666688.385.6290.013.2207295.760.9040168.9442109.74040.9796103.22220.764.85.2012843.814.61.2379.333336
55100.00000064.33333663.6666684.679.9294.020.0271589.280.6667217.2692102.66049.0982148.84101.174.83.0340841.913.60.9555.833332
66108.00000070.00000053.6666686.389.3228.021.98785103.860.8927209.1506108.85549.8714135.69660.454.63.7800046.916.0NaN59.583332
77108.66666469.00000074.3333368.791.2278.08.3189794.681.0396151.9338110.62543.685884.66540.644.55.7892843.414.60.9145.333332
88166.00000087.66666459.33333211.488.2417.012.88460118.080.5085172.423653.98551.4178100.90262.714.82.3251235.212.50.8664.666664
99124.66666482.33333678.3333367.987.0285.018.3185496.660.6667127.1914132.75056.057040.20640.574.44.9106449.717.30.7457.000000

Last rows

df_indexsysbpdiabppulsewbcmcvpltbunglucreachotghdlldlcrphbalcuahtchgbcyscage
1968519903144.00000082.33333665.0000007.497.1203.015.966384115.3153150.825792179.1505749.5575248.648647112.7413101.06.35.652.616.40.9371.666664
1968619904114.33333670.33333675.3333366.091.2216.012.32492888.2882900.800905233.5907389.3805357.142857158.3011603.26.35.149.615.00.9565.250000
1968719905100.66666464.00000061.0000007.7104.8172.011.48459284.6846850.731901181.46718115.0442549.420850111.1969154.86.15.647.715.30.8576.250000
1968819906108.00000070.66666494.6666646.1106.3218.019.047615106.3063050.466064286.48650500.0000030.115830128.18533025.67.05.455.218.70.9561.083332
1968919907113.00000070.66666483.3333365.893.0221.016.806720122.5225200.642534211.1969190.2654957.915060131.6602300.77.15.442.813.20.9969.583336
1969019908144.33333071.33333667.3333368.891.5220.014.00560090.0900900.756788164.0926781.4159352.12355499.2278000.86.54.449.315.60.8959.916668
1969119909116.66666472.00000063.6666685.991.6224.010.92436886.4864900.466064249.03474201.7699155.598457153.6679501.76.65.147.815.20.7552.416668
1969219910146.33333098.00000079.0000004.8102.2161.015.406160100.9009000.652715229.34363244.2477971.814674123.1660203.65.66.536.712.00.8760.750000
1969319911123.33333677.33333661.3333323.689.4201.012.605040109.9099100.792987191.89189255.7522143.24324493.0501900.85.54.338.513.20.8452.333332
1969419912109.00000071.00000064.6666643.396.9176.016.24649693.6936950.832579209.65251106.1946941.698840145.9459500.86.05.041.313.80.5057.583332